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ABSTRACT 

The use of luminous red galaxies as cosmic chronometers provides us with an in- 



P> 1^ dispensable method of measuring the universal expansion rate H(z) in a model- 

O , independent way. Unlike many probes of the cosmological history, this approach 

jj^ • does not rely on integrated quantities, such as the luminosity distance, and therefore 

does not require the pre-assumption of any particular model, which may bias sub- 
sequent interpretations of the data. We employ three statistical tools - the Akaike, 
fvg , KuUback, and Bayes Information Criteria (AIC, KIC and BIC) - to compare the 



ACDM model and the R^ - ct Universe with the currently available measurements 
of H{z), and show that the 7?h = ct Universe is favored by these model selection 
^— V I criteria. The parameters in each model are individually optimized by maximum like- 

^ ' lihood estimation. The Rh - ct Universe fits the data with a reduced ;t'^^j - 0.745 for 

a Hubble constant i/o = 63.2 + 2.5 kms' Mpc', and //q is the sole parameter in this 
k> ' model. By comparison, the optimal ACDM model, which has three free parameters 

^ I (including Hq - 68.9 + 2.4 km s ' Mpc ' , D.,„ - 0.32, and a dark-energy equation of 

state pde - -pde), fits the H{z) data with a reduced ;if^^j, = 0.777. With these ;t'^^j val- 
ues, the AIC yields a likelihood of x 82 per cent that the distance-redshift relation of 
the Rh = ct Universe is closer to the correct cosmology, than is the case for ACDM. 
If the alternative BIC criterion is used, the respective Bayesian posterior probabili- 
ties are 91.2 per cent (Rh = ct) versus 8.8 per cent (ACDM). Using the concordance 
ACDM parameter values, rather than those obtained by fitting ACDM to the cosmic 
chronometer data, would further disfavor ACDM. 

Key words: cosmological parameters, cosmology: observations, cosmology: red- 
shift, cosmology: theory, distance scale, galaxies 



* John Woodraft' Simpson Fellow. E-mail: fmelia@email.aiizona.edu 
t E-mail: rsm@math.arizona.edu 



2 Fulvio Melia and Robert S. Maier 
1 INTRODUCTION 

The expansion of the Universe is now being studied by several methods, including observations 
of Type la SNe (Riess et al. 1998; Perlmutter et al. 1999), weak lensing (Refregier 2003), baryon 
acoustic oscillations (Seo & Eisenstein 2003; Eisenstein et al. 2005; Pritchard et al. 2007; Percival 
et al. 2007), and cluster counts (Haiman et al. 2000), among several others. Each of these methods 
presents its own set of difficulties, among them a dependence on integrated quantities, such as the 
luminosity distance which, however, is not independent of the assumed cosmology. It is therefore 
quite difficult to use the data for unbiased, comparative studies to test different expansion histories. 
This problem is particularly acute in the case of Type la SNe, where at least four 'nuisance' param- 
eters characterizing the standard candle must be optimized simultaneously with the model's free 
parameters, rendering the data compliant to the underlying cosmology (see, e.g., Melia 2012a). 

Even so, some progress has been made recently with attempts at comparing predictions of the 
7?h = ct Universe (Melia 2007; Melia & Shevchuk 2012) with the data, and with ACDM. The 
evidence thus far seems to suggest that the R^ = ct cosmology is a better match to the observations 
at high redshifts, particularly when it comes to the large-scale fluctuations of the cosmic microwave 
background (CMB), expressed through its angular correlation function and the apparent alignment 
of its quadrupole and octopole moments (for a summary of these comparisons, see Melia 2012a). 

In the local universe, these two models are virtually indistinguishable, e.g., in predicting a very 
similar luminosity distance all the way out to a redshift of 6 and beyond. Thus, given the problem 
of identifying model-independent data through Type la supernova observations, it is not easy to 
evaluate one model against the other on the basis of these measurements alone. However, some 
clarification begins to emerge beyond a redshift of 6, where the high-z quasars are now known 
to be accreting at, or near, their Eddington limit (see, e.g., Willott et al. 2010a,b; De Rosa et al. 
201 1). We showed recently that a Hubble Diagram (HD) constructed from these sources reveals a 
cosmic expansion fully consistent with the R^ = ct Universe, assuming a current value of 69 + 4 
km s"^ Mpc"^ for the Hubble constant Ho (Melia 2012b). Interestingly, ACDM can also fit the 
high-z quasar HD, but only for a very specific set of parameters, including a matter energy density 
Q.m = 0.27, scaled to its current critical value. But whereas the R^ = ct Universe has only one 
free parameter - the Hubble constant - the standard model has as many as six (depending on how 
one parametrizes the dark-energy equation of state Wde = Pde/Pde) - including Hq, Q.^, and w^e- 
The implication of this is that the optimization of ACDM simply forces it to relax to the R^ = ct 
expansion profile, which is more robust. 
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Moreover, though the distance-redshift relationship is essentially the same in these two models 
(even out to z > 6-7), the age-redshift relationship is not. In fact, these same high-z quasars 
present a seemingly insurmountable problem for ACDM because they suggest that ~ 10^ M© 
supermassive black holes appeared only 700-900 Myr after the big bang. Instead, in R^ = ct, 
their emergence at redshift ~ 6 corresponds to a cosmic age of > 1.6 Gyr. This was enough time 
for them to begin growing from ~ 5 - 20 Mq seeds (presumably the remnants of Pop II and III 
supemovae) at z < 15 (i.e., after the onset of re-ionization) and still reach a billion solar masses 
by z ~ 6 via standard, Eddington-limited accretion (Melia 2013). 

This kind of tangible result suggests that the 7?h = ct Universe relieves the growing tension be- 
tween ACDM and the observations, but it would be highly beneficial for us to find a way of testing 
this cosmology - and quantifying its superiority over ACDM - by exploiting model-independent 
data in the nearby Universe. The purpose of this paper is to demonstrate that the use of luminous 
red galaxies as cosmic chronometers (Jimenez & Loeb 2002) can do just that. We shall show that 
over the redshift range < z < 1.8, the measured Hubble constant H{z) is fitted better by the 
7?h = ct model than by ACDM; and especially so, if one takes account of the reduction in the 
number of free parameters. Unlike other indicators that rely on the expansion history of the Uni- 
verse, the cosmic chronometers may therefore offer us the best evidence yet that R^ = ct is to be 
preferred over ACDM. 

We introduce the cosmic chronometers in § 2, and in § 3 discuss the AIC, KIC and BIC tools 
we use to test ACDM and the R^ = ct Universe against these data. The results of our comparison 
between ACDM and R^ = ct are presented in § 4 and discussed in § 5. We conclude in § 6 with a 
discussion of future prospects. 

2 THE COSMIC CHRONOMETERS 

Cosmic chronometers offer us the possibility of measuring the differential age of the Universe, 
circumventing the limitations associated with the use of integrated histories, by directly measuring 
the derivative dt/dz, which represents the change in cosmic time as a function of redshift. And 
since H{z) = a/a, in terms of the expansion factor a{t), a measurement of dt/dz directly yields the 
expansion rate, because 

«fe) = ^ = -^|. (1) 

a I + zdt 

For various reasons, the best cosmic chronometers appear to be galaxies that are evolving 

passively on a time-scale much longer than their age difference. Observations indicate that the 
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most massive galaxies contain the oldest stellar populations up to redshifts z ~ 1 - 2 (Dunlop 
et al. 1996; Spinrad et al. 1997; Cowie et al. 1999; Heavens et al. 2004; Thomas et al. 2005). Less 
than 1 per cent of the stellar mass in these massive galaxies formed at z < 1 (Heavens et al. 2004; 
Panter et al. 2007). In high-density regions (i.e., galaxy clusters), star formation ceased by redshift 
z ~ 3 (Thomas et al. 2005), and other massive systems - those with stellar masses > 5 x 10'^ Mq 
- finished their star formation activity by z ~ 2 (Treu et al. 2005). 

The empirical evidence therefore suggests that galaxies in the highest density regions of clus- 
ters formed their stellar content at z > 2, and have been evolving passively since that time, without 
any additional episodes of star formation. One can therefore view these galaxies as tracing the 'red 
envelope,' hosting the oldest stars in the Universe at every redshift. Thus, given their viability as 
cosmic chronometers, a great deal of effort is being expended to calculate d?/dz - and therefore 
H{z) - using their measured properties (see, e.g.. Stem et al. 2010; Stem et al. 2012; Moresco et al. 
2012a,b). 

For example, one of the most direct ways of determining the age of the galaxy is to use the 
4000 A break in its spectmm, which depends linearly on age for old stellar populations (Moresco 
et al. 201 1). This break is a discontinuity of the spectral continuum due to metal absorption lines 
whose amplitude correlates linearly with the age and metal abundance. If the metallicity is known, 
then the difference in age between two galaxies is proportional to the difference in their 4000 A 
amplitudes. 

However, one must also be aware of the fact that many systematic sources of uncertainty can 
bias this kind of analysis (see, e.g., Moresco et al. 2012b). These include: (1) the degeneracy be- 
tween the effect of a change in age and an effect due to a change in stellar metallicity or the star 
formation history; (2) the possible biasing of the estimate of H{z) by the choice of stellar popu- 
lation synthesis model, used to estimate the age or calibrate the 4000 A versus age relation; and 
(3) the possible existence of a progenitor bias (van Dokkum & Franx 1996), in which high-redshift 
samples of early-type galaxies might not be statistically equivalent to those at low redshifts. 

These caveats notwithstanding, one is none the less encouraged by the agreement seen between 
the results of several different approaches. The data set shown in figure 1, including both H{z) 
measurements and error bars, was assembled from the compilations of Simon et al. (2005), Stem 
et al. (2010), and Moresco et al. (2012a), and spans the redshift range < z < 1.8. Together, these 
compilations paint a fairly consistent picture of the universal expansion, particularly when viewed 
in terms of the theoretical expectations, which we shall consider shortly, following our discussion 
of model selection statistics. 
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3 MODEL SELECTION STATISTICS 

To compare the evidence for and against competing models, such as models of the distance- 
redshift relationship, the use of the Akaike Information Criterion (AIC) is now common in cos- 
mology (see, e.g., Takeuchi 2000; Liddle 2004, 2007; Tan & Biswas 2012). The AIC can be viewed 
as an enhanced 'goodness of fit' criterion, which extends the usual ;^f^ criterion by taking account 
of the number of parameters in each model. It prefers models with few parameters to those with 
many, unless the latter provide a substantially better fit to the data. This reduces the possibility of 
overfitting: the fact that by optimizing over a greater number of parameters, one may simply be 
fitting the noise. 

As developed (Akaike 1973; see also Bumham & Anderson 2002, 2004), the AIC provides the 
relative ranks of two or more competing models, and also a numerical measure of confidence that 
each model is the best. These confidences are analogous to likelihoods or posterior probabilities in 
traditional statistical inference. But unlike traditional inference methods, the AIC can be applied 
to models that are not 'nested.' Comparing a pair of models that are nested, in the sense that one is 
a specialization of the other, is straightforward: after fitting each model to the data, one computes 
the;^:'^ per degree of freedom for each, and decides which is a better fit. One can also calculate (say, 
by applying an F-test) a likelihood that the simpler model should be rejected, or the likelihood of 
the null hypothesis that the simpler model is a better approximation to the 'true' one. By exploiting 
the AIC one can generalize this procedure: one can compare a pair of models, neither of which is 
a specialization of the other; such as ACDM and an altemative model. 

The AIC can be applied after regression of the following kind is performed. Suppose that for 
values z\, ■ ■ ■,Zn of an independent variable there are measured values /?!,...,/?„ of a dependent 
one, with (known) error bars +cri, . . . , ±cr„; and suppose the errors are normally distributed. Sup- 
pose that a model A\ predicts values hi, . . ., h„, computed from a formula hi = hiifi) that involves 
a parameter vector fi comprising k unknown parameters, i.e., fi = (J3i,... ,/3k). That is, the data 
model M is really a statistical one, of the form 

hi = hiifi) + (TiZi , (2) 

where Zi, . . . ,Z„ are independent standard normal random variables. (In the case of linear re- 
gression, hi(J3) would be 2;=i ^ijfij for known coefficients Xif, typically, Xij = U^\zi) for known 
functions U^\ . . . , W'' of z.) 
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For model M, i\^QX^ goodness of its fit to tlie data is given by 

n 

x' = Y}^i-hi0)flcr], (3) 

1=1 
i.e., a (weighted) sum of squared errors, and the reduced x^ (i-e-^ the;^f^ per degree of freedom) by 

xlf=X^I(n-k). (4) 

(It is assumed that n > k.) The parameters (/3i, . . . ,/3k) are chosen to minimize the;^^, yielding the 
best fit to the data. The AIC for the resulting fitted model is then given by 

AlC=x^ + 2k. (5) 

If there are two or more competing models for the data, Mi, . . . , Mn, and they have been sepa- 
rately fitted, the one with the least resulting AIC is assessed as the one most likely to be nearest to 
the 'truth,' i.e., to the unknown model Al* that generated the data. A more quantitative ranking of 
models can be computed as follows. If AICq, comes from model Ma, the unnormalized likelihood 
that Ma is closest to the truth is the 'Akaike weight' exp(-AICtt/2). Informally, Ma has likelihood 

..jy. . ^ exp(-AICJ2) 

^ exp(-AICi/2) + --- + exp(-AICW2) ^^ 

of being the best choice. (The 2's here could of course be omitted by redefining AIC, but the 

normalization implicit in Equation (5) is traditional.) In the case of a pair of models Mi,M2, the 
difference AIC2 - AICi determines the extent to which Mi is favored over M2- 

It is clear that the 2k term in Equation (5), proportional to the parameter count k, exponentially 
disfavors models with too many parameters, though such models can be favored if they do a much 
better job of fitting the data. The choice of proportionality constant (i.e., 2) is not entirely arbitrary, 
being based on an argument from information theory that has close ties to statistical mechanics. 
The following is a brief summary, with many more details to be found in the statistics literature. 
(The reader should note that most of the literature focuses on the case when the error variances 
crl, . . .,crl are both unknown and equal to some common variance cr^, a nuisance parameter that 
must be estimated as part of the fitting process; the setup given in Equations 2-3 is actually sim- 
pler.) 

Any two statistical models of the data set (hi,. . . , h„), such as a 'true' model Al* and another 
model M, can be viewed as probability density functions (PDF's) on R", say /*(/ii, . . .,hn) and 
f{hi, . . . , h„), respectively. In information theory one says that the discrepancy of the PDF / from 
the PDF /«, which is a measure of distance, is given by the KuUback-Leibler formula 

D{MA\M) = [ dhi... dK Mh) In ^tt: > (7) 

(where in an obvious notation, the argument h stands for the entire data set [hi, . . . ,/?„]). In a 
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thermodynamic interpretation this is a relative entropy in the sense of Boltzmann and Hasenohrl. 
To select the best model AX from a set of candidate models, one would choose the one with the 
minimum D(A1h.||A1). Of course M* is not known, so this cannot be done literally. But the case 
when A1 is a parametrized model, and its parameters are chosen (by minimizing x^) to fit a data 
set generated by Al*, is special. It can be shown that the AIC of the fitted model A1 is a good 
approximation to 2D{A\J\\A\), up to an unimportant additive constant. This is especially the case 
when Al* is a model of the same type, with unknown parameters {fi\, . . . ,ySp. 

Specifically, AIC/2 is an unbiased estimator of the distance D(A1h.||A1): exactly so for linear 
regression, and to leading order for non-linear regression . The phrase 'unbiased estimator' means 
that on average the two are the same, where the averaging is over data sets generated by M*, 
with PDF /*. Of course the fitted model M depends on the data set, so in the context of Al*, both 
Z)(AlH.i|Al) and AIC/2 are random variables. In probabilistic language, the lack of bias means that 
they have the same expectation. 

The extent to which the fitted AIC is an accurate estimate of 2D(A1h.||A1), data set by data set, 
as well as being the same on average, has been investigated theoretically (Yanagihara & Ohmoto 
2005). Its variability has also been studied empirically; for example, by repeatedly comparing 
ACDM to other cosmological models on the basis of data sets generated by a bootstrap method 
(Tan & Biswas 2012). It is known that the AIC is increasingly accurate when n is large, but it is 
felt that for all n, the magnitude of the difference A = AIC2 - AICi should provide a numerical 
assessment of the evidence that model 1 is to be preferred over model 2. A rule of thumb that has 
been used in the literature is that if A < 2, the evidence is weak; if A « 3 or 4, it is mildly strong; 
and if A > 5, it is quite strong. 

Besides using fixed thresholds, one can weight each candidate model in a Boltzmann-like way 
by its Akaike weight, i.e., according to Equation (6). For each model Ma, the likelihood £,{Ma), 
which is determined by the differences between AICq, and the AIC's of the other model(s), is 
loosely analogous to a posterior probability in statistical inference, despite its not being computed 
by a Bayesian procedure (no Bayesian prior is involved). But in the absence of a general theory 
of AIC variability, deciding between models 1 and 2 cannot be viewed as a hypothesis test, at any 
fixed level of significance such as 0.05. 

Several alternatives to the AIC have been considered in the literature. A lesser-known one 
arises as follows. The discrepancy D(A1h.||A1) is not symmetric in the PDF's /*,/, and it has 
been suggested that it should be replaced by a symmetrized version, which is arguably a better 
tool for distinguishing between data models (Cavanaugh 1999). The unbiased estimator for the 
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symmetrized version has been given the name KIC (KuUback Information Criterion), and is given 
by 

KlC=x^ + ^k. (8) 

The KIC, with k multiplied by the coefficient 3 rather than 2, disfavors overfitting more than does 
the AIC, and has been shown to perform favorably against the AIC as a tool for model selection 
(Cavanaugh 2004). It has long been felt (since, e.g., Bhansali & Downham 1977) that the problem 
of overfitting may be best dealt with by choosing a coefficient that is larger than 2, and perhaps 
even than 3. But the AIC and KIC are the only such schemes that follow readily from information 
theory. 

A better-known alternative to the AIC is the BIC (Bayes Information Criterion), which is a 
misnomer in that it is not based on information theory, but rather on an asymptotic (n -^ oo) ap- 
proximation to the outcome of a conventional Bayesian inference procedure for deciding between 
models (Schwarz 1978). It is defined by 

mC=x^ + (}nn)k, (9) 

and suppresses overfitting very strongly if n is large. Liddle et al. (2006) and Liddle (2007) make 
the case for using BIC in cosmological model selection, and it has now been used to compare 
several popular models against ACDM (see, e.g., Shi et al. 2012). However, it should be noted that 
the monograph of Bumham & Anderson (2002), which popularized Equation (6) for assigning 
AlC-based likelihoods to models, strongly prefers AIC to BIC as a tool for model selection. They 
elsewhere note that the AIC can in fact be interpreted in Bayesian terms, as being the consequence 
of imposing a nonuniform but reasonable choice of prior distribution on the set of candidate models 
(Bumham & Anderson 2004). Kuha (2004) draws further analogies between AIC and BIC, and 
argues that they are both valuable tools. 

In the comparison below, we employ the AIC, KIC, and BIC. We do not employ the so-called 
corrected AIC, denoted AICc, which includes a correction term intended to remove bias when n is 
small (Bumham & Anderson 2002). The correction term is small (cf. Tan & Biswas 2012). More 
importantly, the form of this term is appropriate only for data sets without explicit error bars, with 
the common error variance cr^ estimated as part of the fitting process (Maier 2013, in preparation). 

4 A COMPARISON BETWEEN ACDM AND R^ = CT 

The 7?h = ct Universe is a flat Friedmann-Robertson-Walker (FRW) cosmology that strictly ad- 
heres to the constraints imposed by the simultaneous application of the Cosmological principle 
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and Weyl's postulate (Melia, 2007; Melia & Shevchuk 2012; Melia 2012a). When these ingredi- 
ents are applied to the cosmological expansion, the gravitational horizon R^ = c/H must always 
be equal to ct. This cosmology is therefore very simple, because a{t) oc t, which also means that 
l+z = l/t, with the (standard) normalization that a{t()) = 1. Therefore in the R^ = ct Universe, we 
have the straightforward scaling 

H{z) = (1 + z)Ho . (10) 

Notice, in particular, that the expansion rate H{z) in this model has only one free parameter. By 
comparison, ACDM has as many as six parameters (depending on the application), including Hq, 
the scaled matter energy density Q.,„ (= Pm/Pc, in terms of the matter energy density pm and the 
critical density pc = [3c^ /SnG]H^^), and the dark-energy equation of state w^e = Pde/Pde- 

In this paper, we shall take the minimalist approach and optimize ACDM using only these 
three free parameters. (Using additional parameters would weaken the statistical significance of 
the fit even further, so by selecting this minimal set, we present ACDM in its best possible light.) 
The Hubble constant in this cosmology is therefore given by 

H(Z) = Ho [Q^(1 + Z? + n,(l + Z)^ + ride(l + 2)3(1+"'''=)]'^' , (11) 

where Q.^ and Q^e for radiation and dark energy, respectively, are defined analogously to Q.^. In 
addition, we shall assume a flat ACDM cosmology, for which Q.^ + ^r + ^de = 1, thus avoiding 
the introduction of Q^e as an additional free parameter. Of course, Qr (~ 6 x 10"^) is known from 
the current temperature (« 2.7 K°) of the cosmic microwave background radiation. 

For each model Ma (with a = 1, 2 specifying the 7?h = ct Universe and ACDM, respectively), 
we optimize the fit by finding the model parameter vector ;8q, that minimizes the;^^ . Equivalently, 
we choose ;Sq, to maximize the joint likelihood function 

where the Hi are the measured values of the Hubble constant at redshift Zi, and the H{zi\/3a) are 
the corresponding theoretical values computed from the parameter vector yS^. For a = 1,2, the 
number of parameters (i.e., the length k of the vector yS^,) is respectively 1 and 3, as stated. The 
fitting is a linear regression in the case of the R^ = ct Universe and a non-linear one for ACDM, 
as is evident from Equations (10) and (11). The data set {(z,, //,, (T;)}"^j to which each model is 
fitted was assembled from the H{z) compilations of Moresco et al. (2012a), Stem et al. (2010), 
and Simon et al. (2005), and consists ofn= 19 measured values, each with an error bar. 

The results for the R^ = ct Universe (for which the best fit has Hq = 63.2 + 2.5 km s"^ Mpc"^) 
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Figure 1. Nineteen H(z) measurements, with error bars, and comparison with two theoretical models: (solid) the Sh = ct Universe, with its sole 
parameter Hq = 63.2 ± 2.5 km s"' Mpc"', and (dashed) the standard ACDM cosmology, assuming a flat Universe, with CI,,, = 0.32, SIa = 0.68, 
and Hq = 68.9 ± 2.4 km s"' Mpc"'. The reduced ;t'^^j (with 18 degrees of freedom) for the Sh = c/ fit is 0.745. The corresponding value for the 
optimal ACDM model (with 16 degrees of freedom) is^j^^j = 0.777. 



and ACDM (for which it has Ho = 68.9 + 2.4 km s~' Mpc"\ Q^ = 0.32, and Wje = -1) are 
shown in figure 1. (These Hq values are quoted with one-sigma standard errors, calculated from 
the corresponding ;^f^-distribution for each model.) With 19-1 = 18 degrees of freedom, the 
reduced ;(f^^j for the Rh = ct Universe is 0.745. By comparison, the optimal ACDM fit has 19-3 
= 16 degrees of freedom, and a corresponding reduced x\oi ~ 0.777. Even by eye, one can see that 
7?h = c? is a better fit to the data at z > 0.9. The reduced ;(f^ overall suggests that i?h = ct is at least 
as good as ACDM; especially, when its having only one parameter is taken into account. We shall 
see shortly that on statistical grounds, the Rh = ct distance-redshift predictions are in fact more 
likely than those of ACDM to be closer to the correct cosmology. 

It is worth pointing out that the ACDM model optimized for the cosmic chronometer data alone 
is quite different from the concordance model, characterized by the parameter values H^ = 73.8 + 
2.4 km s ' Mpc ', Qm = 0.27 and Wde = -1- Using the concordance ACDM parameter values to 
fit the cosmic chronometer data yields ;^f^^^ = 0.9567, which is acceptable (since ;^f^^^ < 1.0), but 
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which indicates a noticeably less good fit than both the fits shown in figure 1 . By comparing the 
Rh = ct Universe against the optimized ACDM, rather than against the concordance model, we are 
once again presenting ACDM in its best possible light. 

With n = \9 data points and k = \ parameter, the AIC for the optimized R^ = ct Universe is 
AICi = 15.41. For the optimized ACDM, with k = 3, the corresponding value is AIC2 = 18.432. 
The magnitude of the difference A = AIC2 - AICi, namely A = 3.022, indicates that Mi is to be 
preferred over Ali- According to Equation (6), the likelihood ofR^ = ct (i.e., AI1) being the correct 
choice is X(A1i) = 82 per cent. For ACDM (i.e., M2), the corresponding value is JliM^) = 18 per 
cent. 

If one uses the KIC and BIC statistics instead of the AIC, but continues to weight the models as 
in Equation (6), the difference is greater, since k is multiplied by 3 in the former, and by In n « 2.9 
in the latter. One finds that KICi = 16.41 and KIC2 = 21.434, yielding £{Mi) = 92.4 per cent 
and JliMz) = 7.6 per cent for 7?h = ct and ACDM, respectively. And for BIC, the results are 
BICi = 16.35 and BIC2 = 21.27, yielding X(Mi) = 91.2 per cent and £(M2) = 8.8 per cent. 

According to all three statistics, the predictions of R^ = ct are more likely than those of ACDM 
to be closer to the correct cosmology. This is notably the case for BIC, for which there is an 
accepted interpretation of the magnitude of A = BIC2 - BICi in terms of the strength of the 
evidence against model 2 (Kass & Raftery 1995; Tan & Biswas 2012). If, as here, A = 4.92, 
the evidence against model 2 (i.e., ACDM) would be judged 'positive' (the positive range for A 
extends from 2 to 6, at which point the 'strong' range begins). 



5 DISCUSSION 

Though a measurement of the cosmic expansion rate using early type galaxies is subject to several 
possible systematic errors, the fact that the inferred values of H{z) are model-independent makes 
this a highly desirable and meaningful approach for testing different cosmological models. In 
this paper, we have compared the fits to a data sample drawn from several sources, and have 
demonstrated that the R^ = ct Universe is more likely than ACDM to account for the observed H 
versus z profile. In addition, the inferred value of the Hubble constant Hq is consistent with the rate 
(69 + 4 km s"' Mpc"') emerging from a fit to the high-z quasar Hubble Diagram (Melia 2012b). 

This is rather impressive, given that the former corresponds to a probe of the local Universe (at 
z < 2), whereas the latter concerns the cosmic expansion at high redshift (z > 6). In addition, one 
should not underestimate the fact that in R^ = ct, there is only one free parameter. By comparison, 
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the standard model of cosmology, with as many as six, depending on how one parametrizes the 
dark-energy equation of state, fails to account for the appearance of high-z quasars at redshift > 6 
(Melia 2013). This type of comparative analysis therefore supports the suggestion that i?h = ct is 
closer to the correct cosmology than is ACDM. The growing tension between the predictions of 
ACDM and the ever improving cosmological data also suggests that the current standard model 
may be a useful approximation, but will probably not endure in the long run. 

Recently, however, some criticism has been leveled at the i?h = ct cosmology on the basis of 
several claimed inconsistencies, some theoretical, others observational (Bilicki and Seikel 2012). 
One of the observational arguments was based on the same cosmic chronometer data we have ad- 
dressed in this paper, from which a different conclusion was arrived at from the one obtained above. 
However, these earlier results are incorrect: simply, because they were not based on a proper sta- 
tistical analysis. Those conclusions appear to have been based on a qualitative inspection by eye. 
But clearly, the results presented here show that such an approach does not stand up well to a 
quantitative assessment based on comparisons of likelihoods. And since the cosmic chronometer 
data favor the R^ = ct cosmology over ACDM when using a simple, direct statistical compari- 
son, any higher-order metric employed with the H(z) data, particularly those designed to test the 
parametrization of ACDM, e.g., the decomposition of density into the three specific components, 
matter, radiation, and dark energy, cannot be used to meaningfully constrain the R^ = ct Uni- 
verse. On the contrary, as we have shown here, the cosmic chronometer data - when interpreted 
quantitatively - suggest that the 7?h = ct Universe is at least as good as the standard model. 

The second observational argument for the criticism was based on the analysis of Type la SNe. 
However, here too the data were used incorrectly to arrive at an invalid result. The use of Type la 
supernova data ignored the fact that these were optimized for a pre-assumed ACDM cosmology. 
Therefore, these cannot be used for a comparative test between different expansion scenarios. A 
complete discussion of this problem has already been published in Melia (2012a), so we shall not 
reproduce it here. 

The danger of using data optimized for ACDM to test other cosmologies has also been high- 
lighted recently by an examination of the Gamma Ray Burst (GRB) Hubble Diagram (Wei et al. 
2013). In this work, the data were recalibrated separately for each model and, though the results 
are quite similar, a comparison of the reduced Xdoi^ ^°^ ^h = ct and ACDM shows that the data 
clearly favor the former over the latter. This result would not have been evident without a recal- 
ibration of the data using the R^ = ct expansion history. Given the preponderance of evidence, it 
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seems likely that when the Type la SN data are calibrated correctly for each cosmology, 7?h = ct 
will emerge as the more likely of the two to be correct. 

Finally, it is worth pointing out that the 'theoretical difficulties' invoked to argue against 
Rh = ct are based on a failure to comprehend fully Birkhoff 's theorem and its corollary, and the 
consequence of Weyl's postulate on Friedmann-Robertson-Walker (FRW) metrics. It is not the 
purpose of this paper to correct this misunderstanding, but since it appears to be an issue relevant 
to the interpretation of cosmic chronometer data, we shall address it here as well. 

Birkhoff 's theorem and its corollary (Birkhoff 1923) place no limit on scale, so Bilicki and 
Seikel's (2012) assertion that the definition of a Schwarzschild (i.e., a gravitational) radius makes 
no sense on cosmic dimensions is without foundation. Moreover, one does not 'define' a Schwarz- 
schild radius, as was claimed; this scale emerges automatically when one re-writes the metric in 
terms of observer-dependent coordinates versus the more commonly used co-moving coordinates 
(see, e.g., Melia & Abdelqader 2009). Many are perhaps not aware of the fact that exactly the 
same phenomenon occurs when writing the spacetime metric for compact objects. The distinction 
arises between a free-falling observer and the observer at a fixed radius (and hence accelerated) 
relative to the central mass. The former is not aware of the gravitational radius that emerges only 
when the metric is written using rulers and clocks fixed to the latter. In the cosmological context, 
we are free-falling observers when we write the FRW metric using co-moving coordinates. How- 
ever, a gravitational radius emerges when we re-write this metric in terms of an observer's fixed 
coordinates. 

The irony, of course, is that the gravitational radius in cosmology actually first appeared as 
far back as 1917, though its meaning was not then fully appreciated, de Sitter's (1917) paper 
on his now famous metric was originally written in terms of the observer's fixed coordinates, 
which included the gravitational radius, since the co-moving coordinates would be introduced by 
Friedmann only several years later. The argument against the validity of a gravitational horizon 
in cosmology would therefore imply that de Sitter space is meaningless on large scales. This is 
simply not true. 

And since the meaning and validity of the gravitational radius in cosmology (which, by the 
way, coincides with the better known Hubble radius) were not appreciated, the consequence of 
Weyl's postulate on its permitted rate of expansion was ignored. Since the Hubble radius is a 
'proper' radius, it has no choice but to expand at a constant rate, as demonstrated by Melia & 
Shevchuk (2012) and (in the more pedagogical treatment) by Melia (2012c). 
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6 FINAL REMARKS 

Euclid (Laureijs et al. 201 1) and BOSS (Eisenstein et al. 201 1) should provide thousands of passive 
galaxies at z > 0.5, which will significantly improve the accuracy of H{z) at these higher redshifts. 
In concert with this improved statistic, it will be essential to understand better if the systematic 
effects, e.g., the error due to the metallicity and star formation uncertainties, may be controlled 
and minimized. It is crucial to carry out this arduous work, because these cosmic chronometers 
are among the few sources that provide us with model-independent data. And as we have seen, 
only such model-independent data can truly distinguish between competing cosmologies. 

We end with a word of caution. It should be evident from the contents of this paper how 
important it is to use only model-independent data in any comparative analysis between competing 
cosmologies. In some cases, it is simply not possible to avoid the 'circularity problem,' in which 
a model must be pre-assumed in order to extract the data. This is certainly the case in the Type la 
SN work, but also when dealing with any observations requiring the use of integrated quantities, 
such as the luminosity distance. 

An entirely different approach sometimes used to determine H{z) is based on the identification 
of Baryon Acoustic Oscillations (BAO) and the Alcock-Paczynski distortion from galaxy cluster- 
ing. That is, instead of using information on how cosmic time changes with z (as is the case for 
the data we have used here), this alternative approach measures how 'standard rulers' evolve with 
redshift. The results of these two different methods are sometimes combined to produce an overall 
H versus z diagram, but there is a good reason to be wary of this procedure: whereas the cosmic 
chronometers produce model-independent data, the second approach must necessarily assume a 
particular cosmology and is therefore model-dependent (Blake et al. 2012). 

With the BAO method, the cosmic expansion is measured from the growth of structure as a 
function of redshift. Redshift-space distortions arise because the recession velocities of galaxies, 
from which distances are inferred, include contributions from both the Hubble flow and from the 
peculiar velocities driven by the clustering of matter (see, e.g., Hamilton 1998 for a review). The 
oscillations are modeled via the non-linear evolution of both the matter density and velocity fields, 
which are quite different between, say, ACDM and R^ = ct (Melia & Shevchuk 2012). In addition, 
to compute redshift space separations for each pair of galaxies given their angular coordinates and 
redshifts, one must adopt a cosmological model for the expansion to relate these quantities to each 
other. 

Unfortunately, this gives rise to a situation not unlike that currently existing with Type la SNe 
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(Melia 2012a), in which one must simultaneously optimize at least four nuisance parameters incor- 
porated into the description of the measurements, along with the free parameters of the assumed 
model. One must therefore avoid the use of such model-dependent data in any attempts to di- 
rectly compare fits using ACDM with those of other models, such as R^ = ct. Only the cosmic 
chronometer data are truly model-independent and therefore appropriate for this purpose. 
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